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GENETIC REFERENCE MATERIALS 



Field of the foventioq ; 



The invention relates to methods of producing and maintaining generic reference 
materials for use as controls in genetic testing. 

Background and Prior Art known to the Applicant 



In clinical genetic diagnostics,, it is of utmost importance that test results are accurate. In 
pre-natal diagnostics for example, false-positive results may lead to the termination of a 
normal foetus and a false-negative result may lead to the birth o£ or failure to diagnose an 
affected child. In clinical settings, results of genetic tests may form the basis for clinical 
2$ intervention, and it is essential therefore that the results obtained are soundly based. 
Increasingly, genetic tests are also used to identify individuals in a population who may 
have a {we-disposduon to disease states. Knowledge of the pre- disposition may lead to 
effective prophylactic measures, either through clinical intervention or adjustment of 
lifestyle factors. 



To ensure the reliability of such genetic tests, the use of genetic reference materials allows 
positive or negative controls to be present in each test, thus validating the results. These 



09-DEC-E003 11:55 FROM: IPE1 LTD. 



01603 E5 1125 



TO: +01633 814444 



P.035'0c 





reference materials consist of DNA sequences, which thus far have been provided by one 
of three methods: 

1 . PCR (polymerase chain reaction) product 
s 2. Plasmid-cloned PCR product 
3. Human genomic DNA* 
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15 



20 



25 



30 



Whilst the use of PCR-produced DNA may seem attractive for production of genetic 
reference material using the sequence of interest, it is associated with a number of 
disadvantages. The extremely large entity of DNA produced by such a technique (way 
in excess of that found in a typical patient sample) poses a significant eotrttounation risk 

is unstable, leading to a 

Jack of Precision in its use. Another problem that occurs is that the reference genetic 
sequence produced by such a technique is produced to isolation, U. without the normal 
background non-target DNA that .would be found in a patient-derived sample. Thus, the 
nature of such stmterts is con^d^ly du^t ftom the test samples with the risk of 
arte&cts in the assav. : - 



. • - • - 
> ■ ■ 



Rationed PCR product may also be produced by introducing the human genetic 
reference sequence sequent in a plasmid into an organism such as Recti Whilst such a 
mechanism may be more stable man raw PCR-derived product, there still remains a 
ination risk due to the higb level of DNA material produced and questions 
regarding long-term stability. This lack of stability may lead, among other things, to a 
loss in quality or quantity of reference DNA produced by such a method due to e. S . 



H 
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The third approach, me use of human genomic DNA is not associated with the problems 
of contanunation. and instability seen in the first two methods, but has a number Of 
disadvantages of its own. Firstly, any human genomic DNA sample, in the form of 
human cells, will be rapidly consumed in use, and so an 'immortalised' ceil line is 
required. This immortalisation process is time consuming and will usually involve the 
handling of live Epstein-Bar Virus with the associated potential health risks to the 
operator. The production of an immortalised cell line in this way also requires the use of 
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fresh patient-detived blood, which is often difficult to obtain. All three of these 
approaches require, of course, full informed consent from the patient from which the 
material is derived. 

5 It is an object of the present invention to provide an alternative source of genetic reference 
material that can be used to standardise genetic testing. 

Summary of the Invention 

10 In the summary of the invention that follows, the term "human genetic reference 
sequence" comprises a human DNA sequence containing at least one genetic variant 
whose presence in the DNA of a human subject is indicative of a pathological condition* a 
predisposition to & pathological condition, or a predisppsition to an adverse reaction to 
external stimuli. The said genetic variant comprises a changes insertion or deletion of one 
. . w or more bases with respect to the most common sequence in a human .population, and 
. includes single nucleotide polymorphisms (SNP^ mutations, base or sequence insertions* 
base or sequence deletions and a change in tandem repeat length. The reference sequence 
is characterised in thai its total length is at least 35 bases, and does not exceed 30 
kilobases. For some applications, the minimum length of the reference sequence may 

to need to be more than 1 00 bases* to allow detection in an assay in which the reference is to 
be used. A length of 500 bases will be sufficient for almost all applications, but the 
skilled addressee may readily determine an appropriate length by routine experiment, and 
without further inventive thought 

2& The invention provides a genetic reference standard comprising at least one human 
genetic reference sequence cloned into a non-mammalian animal cell line. 

Preferably, the animal cell line is an avian cell line, more preferably a chicken (Gallus 
gallus) cell line and most preferably the chicken DT40 cell line. 

According to any aspect of the invention the at least one human genetic reference 
sequence is cloned into a dispensable region of the cell's genome. 
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Also according to any aspect of the invention, the at least one human genetic reference 
sequence is cloned into a. non-expressed region of the cell's genome. 

Also according to any aspect of the invention, the cloned cell line is diploid with respect 
s to the human genetic reference sequence. 

Also according to any aspect of the invention, the at least one human genetic reference 
sequence is a plurality of human genetic reference sequences. 



10 Also according to any aspect of the invention, the or each human genetic reference 
. sequence is not a functional chromosome. 



There is further provided a method of detecting a genetic variant in a sample containing 
human DNA comprising: 
JS performing a test, responsive to DNA sequence, on said sample; 

...... performing the same test on a reference sample embodying the genetic variant W be 

. detected: 

comparing the test results obtained from said sample and said reference sample to 
. determine the presence or absence of said genetic variant; 
20 characterised in that said reference sample is a genetic reference standard as described in 
any aspect of the invention above. 



The inv 



ion will be described by reference to the accompanying drawings in which: 



25 BrirfDtwaription n f the Drawm a* 



Figura 1 illustrates a targeting vector suitable for introducing a human genetic reference 
sequence into a suitable cell line. 

30 KsWC 2 illustrates a targeting vector suitable for introducing a human genetic reference 
sequence into a suitable cell line, together with the range of cells so produced; and 
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Figure 3 illustrates a targeting vector and a scheme by which heterozygous cell lines may 
be produced. 

Cloning »NA fragments into a heterologous exikaryotic cell line in this way acts as an 
intermediate form of genetic reference material. Being gnomic-based, the material 
would be both stable, and not present a contamination risk. Furthermore, by not 
containing human sequences, the background DNA would be less likely to cross-react in 
any human testing protocol. Patient blood would not be required and the r~ J "~ 
pathogenic human virus would also be avoided. 



10 



is 



to 



The W ired human genetic reference sequence may be derived from a patient possessing 
the genotype associated with me test. Alternatively, the sequence may be obtained from 
any unaffected individual and artificially modified such that the DNA sequence matches 
that of the mutant or rare form. Thus, the PGR product being cloned may derive from a 
buccal swab and need not even be patient-derived. Also, the sequence may be denved 
from a normal human cell line, and engineered to introduce the required variant**). The 
use of a cell line derived from an anonymous donor In this way would obviate the 
requirement for informed consent from a known donor. Furthermore, if the required 
human genetic reference sequence were of a sufficiently short length, then it could also be 
A from Imowledee of to sequence. 



Thus, 



recombination 



ice material may be produced by the cloning of one or more 
(heterologous) nonrmammalian cell line. The use of 

of cell lines with, a controlled 



^1 









number 



targeted 



location withi 



so 



la order to illustrate typical applications for the invention, we consider some of the many 
genetic screens for which the invention can provide reference materials. Genetic 
screening may be carried out for a number of ends, such as: 

1 . Screening for mutations that cause rare diseases; 

2. Screening for DMA variants which predispose to common diseases; and 
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3. Screening for DNA valiants which effect drug response. 
Examples of each of these three types are given below. la the partial reference sequences 
quoted, the altered bases are indicated by the use of lower case letters. 

5 /- Gene mutations or variants that cause genetic disorders 



10 



qystic Fibrosis 

Cystic fibrosis is a common (recessive) monogenic disease in European populations, 
occurring 1 in around 2500 live births. There are many causative mutations* but in the UK 
population S> mutations account for around 83% of ths CF mutations [DF508 - 75.3%; 
G551P - 3*08%; G542X - 1.68%; 621 * I (G > T) - 0.93%; 1717-1(G> A) - 0.57%; 1898 
+ 1)(G > A) - 0.46%; RUTH - 0.46%; N1303K 0.46%; R553X - 0.46%] 



15 



20 



25 



SO 



DF508 DNA sequence (TTT deletion): 

TCAGTTTTCCTGGATTATGCCTGGCACCAT^^ 

TCCTATGATGAATATAGATACAGAAGCGTCATCAAAGCATGCCAACTAGAAG 
AGGACATCTCC 



1(H) Sickle Cell Anaemia 

Sickle cell anaemia is an inherited blood disorder characterized primarily by chronic 
anaemia and periodic episodes of pain. Sickle cell anaemia is an autosomal recessive 
genetic disorder caused by a defect in die beta-haemoglobin gene (HBB). Hie disease 
occurs in about 1 in every 500 African-American births and 1 in every 1000 to 1400 
Hispanic- American births. Although several hundred HBB gene variants ate known, 
sickle cell anaemia is most commonly caused by tiie hemoglobin variant Hb S. In this 
variant (E6~V) the amino acid valine takes the place of glutamic acid at the sixth amino 
acid position of the HBB polypeptide chain, 

NCBI SNP CLUSTER ID: rs334 

ACCTCAAACAGACACC^TGGTGCACCTGACTCCTGa/tGGAGAAGTCTGCCGTT 
ACTGCCCTGTGGGG 



I(Hi) Myotonic dystrophy 
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Myotonic dystrophy is a dominantly inherited disease in which the muscles contract but 
have decreasing power to relax. With this condition, the muscles also become weak and 
waste away. Unaffected individuals have between 5 and 27 copies of an 'AGC/CTQ 
triplet repeat 1 hi the 3* untranslated region of a protein kinase gene. Myotonic dystrophy 
patients who arc minimally affected have at least SO repeats, while more severely affected 
patients have an expansion of up to several lriloba$e pairs* 



U Gene variants that predispose to disorders 

10 U(i) Factor 2 (Prothrombin) 

This is a G-to-A transition variation at position 20210 in the 3* untranslated region of the 
prothrombin gene that is associated with elevated plasma prothrombin levels and an 
increased risk of venous thrombosis, The minor (A) allele is present at a frequency of 
around 1%, so that individuals heterozygous for the variant occur at a frequency of around 

is 2%. Individuals homozygous for the variant occur very rarely at a frequency of around 1 J 
ral0,0P0. 

NCBI SNP CLUSTER ID: ts1799963 

GTTCCCAATA AAAGTGACTCTCAGCg/aAGCCTCAATQCTCCCAGTGCTATTC 

20 

n(U) Factor S 

This is a G-to-A variant at position ' 1691' causing an Argwine to Glutamine amino acid 
substitution. Again, this is associated with risk of venous thrombosis. Individuals 
heterozygous for the variant occur at a frequency of around 5%, and individuals 
25 homozygous for the variant occur at a frequency of around 1 in 1 650. 
NCBI SNP CLUSTER ID: rs6025 

TCTGTAAGAGCAGATCCCTGGACAGGCg/aAGGAATACAGGTATTTTGTC 
CTTGAAGTAA 



so n(iU)Hereditary haetnochromaiosis 

Hereditary haemochiomatosis is a common (recessive) iron-overload disorder. There are 
two common mutations: C282Y and H63D. The C282Y mutation results from a G-to-A 
transition at nucleotide 845 of the HFE gene (845G to A) that produces a substitution of 



|Qp88^6^C^<ee£tl^Igsg5g 
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8 





m 




it 



replace 



of these 



S3 in the HFE protein. Individuals 
ound heterozygous have an increased ri 



frequency 



0.07 and H63D has && allele frequency of around 0,14. 



H63D DNA sequence: 

GACCAGCTGTTCGTGTTCTATGATc^^ 



C282Y DNA sequence: 

CGCTGGGGAAGAGCAGAGATATACGTg/aCCAGGTGGAGCACCCAGGCCTGGA 
TCAGCC 



13 



20 



2$ 



3Q 



- Gene variants affecting drug response 



» * 



111(1) Thiopurine S-meihyltrantferqse (TPM1) 

TPMT gene variation affects an individual's ability to metabolize the thiopurine class of 
drugs. Studies have shown that one in 300 individuals (0.3%) have low to absent levels of 
TPMT enzyme activity (homozygous recessive)* 1 1% have intermediate levels of enzyme 
activity (heterozygous) and 89% have normal to high levels of enzyme activity 
(homozygous normal). TPMT testing allows physicians to identify, prior to initiating 
therapy, patients who are at risk for developing acute toxicity to the thiopurine class of 
drugs. 

Four variant TPMT alleles have been identified (TPMT*!, TPMT*3A, TPMT»3B S 
TPMT*3C)» which account for -80% of Caucasians with low or intermediate TPMT 
activity. TPMT*2 contains a G->C substitution at nucleotide 238, while TPMT* 3 A 
contains two nucleotide transition mutations (G460A and A719G). TPMT*3B has only 
G460A, while TPMT* 3 C contains only A719G. The Caucasian allele frequencies me: 
TPMT*2 - 0,5%; *3A - 4.5%; *3C - 03% 
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This range of examples of genetic screens will allow the skilled addressee to identify 
other potential applications for the current inventions. Whilst only short sequences have 
been identified in the above examples, longer sequences omtaining the same genetic 
variants may Teadily be constructed by reference to the published sequence data, should 
these be required in any assay using the standard. The genetic variant (e.g. SNP, 
mutation, deletion, insertion etc.) may conveniently be located in any position in the 
human genetic reference sequence, as required for any subsequent assay. However, for 
some assays, it is particularly advantageous to locate the variant towards the centre of the 
human genetic reference sequence, Preferably also, the human genetic reference 
sequence is not a functional chromosome (i.e. unable to stably replicate independent of 
the host genome) and most preferably non-centromeric. 

Preferably, the human genetic reference sequence may be cloned into a non-mammaUan 
eukaryotic cell to provide a genomic DNA background that would not be likely to cross 
react with the generic test- Avian cell lines are particularly suitable for this purpose, as the 
genomic DNA is substantially different to that of humans. Of these line, cells from 
chicken (Gallus gallus) are an especially advantageous heterologous host, as not only is 
chicken genomic DNA substantially different to that of humans, but also has a similar size 
(ea 1.2Gigabaaes for chicken compared with 320igabases for humans). 



The chicken B-cell line DT40 (Baba & a/,Virology 144:139-151, 1985) is a particularly 
effective cell line for this purpose, as it is highly recombinstion-efEcient and avoids the 
likelihood of multiple integrant copies and instability associated with random integration* 
There is also a considerable existing literature on techniques for genetic manipulation of 
DT40. Using the cells recombination machinery a single DNA molecule may be 
integrated into a defined position by the use, e.g> of targeting arms. Such techniques will 
be illustrated in the embodiment below. 



In order to mimic the situation that may he encountered in human patient-derived 
so samples, both homozygous and heterozygotes may be produced as desired 

In order to facilitate the construction of a number of manipulated cell lines for differ ent 
genetic tests, a single targeting construct, given the teaching of this disclosure, may be 
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constructed that would serve for all required DNA fragments. Alternatively, a pair of 
constructs with- identical targeting arms but different antibiotic resistance genes (see 
below) may be used for the production of beterozygotes* Finally, by using multiple 
recombination sites, single cell lines may be produced carrying multiple reference 
fragments. This approach may be facilitated by the use of mutated LoxP sites, to allow 
tbe re-use of antibiotic resistance markers. 



Embodiment 1 

is This embodiment illustrates a way in which the invention may be worked to create a 
genetic reference standard by insertion of a human genetic reference sequence into a 
dispensable region of the chicken DT40 cell line. 

: • . 

t * * m 

Figure 1 shows, diagrtuamatically, a targeting vector that may be used to realise the 
*s, current invention. The vector, generally h comprises the ]J|luescript sequence 2, of use 
in the bacterial stages of construction of 5 the targeting plasmid, a left targeting aim 3, the 
human DNA fragment 4 to act as the reference material, an antibiotic resistance gene 5 
and a tight targeting arm 6, The targeting aims cany chicken DNA sequences for 
homologous recombination, enabling the integration of the human sequence and the 
I 2D antibiotic resistance gene into a specific site of the DT40 genome. The antibiotic 

resistance gene 5 may be flanked by mutant LoxP sites 7* 8. In the example shown in 
Figure 1 3 there is a LoxP RE mutant 7 and a LoxP LE mutant 8. These LoxP sites enable 
| the removal of the antibiotic resistance gene by use of the enzyme CRE Reoombinase, 

| once the vector sequences are integrated into the chicken genome* this technology is 

| 25 described in Arakawa et al, BMC Biotechnology 2001, 1:7. Situating the mutant LoxP 

sites flanking the antibiotics resistance gene enables the subsequent removal of the 
antibiotic resistance gene, facflitating the re-use of that antibiotic selection marker in any 
further gene-targeting events in the modifiedreeil line. The targeting vector 1 also has a 
unique restriction enzyme site 9 to enable the vector to be linearised by cleavage with, a 
36 restriction enzyme, prior to introducing it into the host DT40 cell lines by electroporation- 
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£l targeting arm 3, 6 would be 2-Skilobases in size. In human DNA the 
fragment would typically be around IkDobase in size, and the variant base 
tied towards the centre of the fragment 



In this example, the human genetic reference sequence 4 is to be inserted in a, dispensable 
region of the DT40 genome. A suitable dispensable region is the genes coding for the 
high mobility group A (HMGA) family of nonhistone chromosomal proteins, encoded by 
the two related genes, HMGA1 and HMGA2. It has been shown by Beitzel and Bushman 
("Construction and analysis of cells lacking the HMGA gene femily/ 5 Nucleic Acids Res. 
2003 Sep l:31(l7);502S-32.) that the HMGA gene femily is dispensable for growth in 
DT40 cells. They found no significant changes in the activity of approximately 4,000 
chicken genes following deletion of either or both HMGA I or HMGA2. They concluded 
that the HMGA proteins are not strictly required for growth control in DT40 cells. This 
region of the DT40 genome is thus a suitable target for insertion of the human genetic 
reference sequence 4. Others may readily be found by the stalled addressee, by reference 
to the literature (see eg. Li Y, Strabler JR, Dodgson JB. "Neither HMG-I4anor HMG-17 
gene function is required for growth of chicken DT40 cells or maintenance of DNasel- 
hypersensitive sites." Nucleic Acids Res. 1997 Jim IS; 25{2):2s3-8) or sequence 
databases. 



Thus, the left targeting aim 3 and right targeting arm 6 may be constructed by reference to 
the published sequence of the HMGA gene femiiy. The plasmid 14 may therefore be 
constructed using the well-established pBluescript constructs using these targeting arms 3 
and 6* and the human genetic reference sequence 4. A suitable antibiotic resistance 
marker 5 will be evident to the skilled addressee and would include, for example, 
Neomycin, Puramycm or Plastiddin. These antibiotic resistance genes may be driven by 
me chicken B-actm promoter. Detailed protocols for construction of the plasmid will be 
immediately evident to the skilled addressee given this teaching. 

Following construction of the plasmid 1, the skilled addressee will he readily able to 
transform the host DT40 cells by, for example, linearisation of the plasmid 1 with 
restriction enzyme specific for the restriction she 9 and introduce the linearised construct 
into DT40 by e.g. eiectroporataon. 
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gmhndimeatlt 2 



Ms embodta** demons*^ haw the invito may b. v,o*ed to CKtfe a di-allcUc 
5 genetic reference standard* 

Figure 2 illustrates the first part of the cell line construction process. There is wustrated a 
plasmid 14 containing the pBluescript sequences 2, a left targeting arm 3 and a right 
targeting arm 6. and an antibiotic resistance marker 15 with appropriate promoters. The 
« plasmid also has the mutant LoxP sites 7 and 8. The first human genetic reference 
sequence which will make up one of the alleles is illustrated as 16. 



MS 



20 



Host DT40 cells to be transformed in this first step are represent* 
native chicken DNA alleles 12 and 13 illustrated at the cloning site. 



Following recombination after introduction of the plasmid 14 into the host ceils n, mree 
cell types may be present Cell type 17 represents a hemizygote containing the integrated 
human genetic reference sequence 16 and die antibiotic resistance marker 15 flanked by 
the two mutant LoxP sites 7 and 8. There may also be cells IS homozygotio for the 
human genetic reference sequence 16, the antibiotic resistance marker 15 and the two 
mutant LoxP sites 7 and 8. Finally, there will be a population of cells 1 9 that has not been 



tranSi 



r«iwi i 
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The untransfotmed cells 19 may be eliminated by selection with the appropriate antibiotic 
leaving a mixed population of hentizy gotes 17 and homozygotes 18. There may also be 
some cells present (not illustrated) containing additional copies of plasmid-derived 
genetic material by random integration (i.e. not at the target site). Clonal snb populations 
from this mixed culture may be readily produced by the skilled addressee by, fox example, 
the use of a dilution technique or flow cytometry assisted cell sorting. (If it is desired to 
use flow cytometry to produce a clonal sub-population, then the gene for green 
fluorescent protein - or a similar fluorescent protein - may be conveniently attached to 
one end of -me targeting construct, so that it is retained, and expressed, in random 
integra nts, but eliminated from targeted integeants, so allowing the separation of the 
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desired cells by Fluorescence Assisted Ceil Sorting). These clonal cell lines may then be 
screened using eg. PCR and Southern Blotting to choose the ii&e 17 that is hemizygous 
for the human genetic reference sequence 16. 
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This hemizygous cell line 17 is then used as the host for the second stage of the 
procedure, which is illustrated in Figure 3. A second plasmid 20 is used in this stage of 
the process. This again may contain the pBhiescript elements 2, the left and right 
targeting arms 3 and 6, and the unique restriction site 9. This second plasmid 20 also 
contains a second antibiotic resistance marker 21, distinct from the first marker 15 
illustrated in figure 2, Thx$ second marker 21 may again be flanked by the mutant LoxP 
sites 7 and 8> Included in this second plasmid 20 is the second human genetic reference 
sequence 22. This could be identical to that used in the first stage to create a homozygous 
cell line, or could be the human genetic reference sequence without the SNP to create a 
heterozygous standard 

Using the hemizygous cell line 17 as the starting host, recombination may be performed 
using this second plasmid 20 as before. Predominant cell types resulting from this will be 
heten>zygote$ 23 containing both antibiotic resistance markers 15, 21 and both human 
genetic reference sequences 16 and 22, There may also be some cell types that are 
homozygous 24 for the second marker 21 and sequence 22 where these sequences have 
replaced those inserted in the first stage. There will also be cells that are hemizygotic 17, 
i.e. where no recombination has occurred in this second stage. The heterosygotic cells 23 
may be selected by the use of both antibiotic selection markets. The resistance markers 
15 and 21 may then be removed from these hetenftjygotic cells 23 by the use of Cre 
Recombinase to produce the genetic reference standard cells 24 containing just the two 
reference sequences and the non-mutant LoxP sites 25* 



The invention is described in the claims that follow, in which the term "human genetic 
reference sequence' 1 comprises a human DNA sequence containing at least one genetic 



30 variant whose presence in the DNA of a human subject is indicative of a pathological 












t, a predisposition to a pathological condition, or a predisposition to an adverse 
reaction to external stimuli. The said genetic variant comprises a change, insertion or 
deletion of one or more bases with respect to the most common sequence in a human 
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population, and includes single nucleotide polymorphisms (SNP), mutations, base or 
sequence insertions, base or sequence deletions and a change in tandem repeat length. The 
reference sequence is characterised in that its total length is at least 35 bases, and does not 
exceed 30 kilobases. For some applications, the minimum length of the reference 
sequence may need to be more than 1 00 bases, to allow detection in an assay in which the 
reference is to be used. A length of 500 bases will be sufficient for almost all 
applications, but the skilled addressee may readily determine an appropriate length by 
routine experiment, and without further inventive thought. 
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Claims 



1 . A genetic reference standard comprising at least one human genetic reference sequence 
cloned into a non-toamtnalian animal cell line. 



2. Tbe genetic reference standard of claim 1 therein the animal cell line is an avian cell 
line. 



3. The genetic reference standard of claim 2 wherein the cell line is a chicken (GalluS 
10 gallus) cell line. 

4. The genetic reference material of claim 3 wherein the chicken cell line is the chicken 
DT40 ceU line. 

13 5. A genetic reference standard according to any preceding claim wherein the » least one 
human genetic reference sequence is cloned into a dispensable region of the cell's 

genome. 



6. A genetic reference standard according to any preceding claim wherein the at least one 
20 human genetic Teference sequence is cloned into a non-expressed region of me cell's 
genome. 



7. A genetic reference standard according to any preceding claim wherein the cloned cell 
line is diploid with respect to the human genetic reference sequence. 



25 



S . A genetic reference standard according to any preceding claim wherein the at least one 
- «n genetic reference sequence is a plurality of human genetic reference sequences, 



HM.H 



&. A genetic reference standard according to any preceding claim wherein the or each 
30 hnman genetic reference sequence is not a functional chromosome. 



10. A method of detecting a genetic variant in a ssmple containing human DNA 
comprising: 
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performing atest, responsive to DNA sequence, on said sample; 

performing the same test on a referenee sample embodying the genetic variant to be 

detected; 

comparing the test results obtained from said sample and said reference sample to 
determine the presence or absence of said genetic variant; 

characterised in mat said refer*** sample is a genetic reference standard according to 
any preceding claim* 
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ABSTRACT 

Genetic Reference Materials 

5 The invention provides a genetic reference standard comprising at least one human 
genetic reference sequence (comprising a human DNA sequence containing at least one 
genetic variant whose presence in the DNA of a human, subject is indicative of a 
pathological condition, a predisposition to a pathological condition, or a predisposition to 
an adverse reaction to external stimuli) cloned into a non-mammalian animal cell line, 
to There is also provided such reference standards where the human DNA is targeted to 
specific location in the host genome* using homologous recombination. The invention 
further provides a method of detecting a genetic variant using such reference standards. 

15 Figure 1 should accompany the abstract. 
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